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Abstract 

Background: Chinese bayberry {Myrica rubra Sieb. and Zucc.) is a subtropical evergreen tree originating in China. It 
has been cultivated in southern China for several thousand years, and annual production has reached 1.1 million 
tons. The taste and high level of health promoting characters identified in the fruit in recent years has stimulated its 
extension in China and introduction to Australia. A limited number of co-dominant markers have been developed 
and applied in genetic diversity and identity studies. Here we report, for the first time, a survey of whole genome 
shotgun data to develop a large number of simple sequence repeat (SSR) markers to analyse the genetic diversity 
of the common cultivated Chinese bayberry and the relationship with three other Myrica species. 

Results: The whole genome shotgun survey of Chinese bayberry produced 9.01Gb of sequence data, about 26x 
coverage of the estimated genome size of 323 Mb. The genome sequences were highly heterozygous, but with 
little duplication. From the initial assembled scaffold covering 255 Mb sequence data, 28,602 SSRs (>5 repeats) were 
identified. Dinucleotide was the most common repeat motif with a frequency of 84.73%, followed by 13.78% 
trinucleotide, 1.34% tetranucleotide, 0.12% pentanucleotide and 0.04% hexanucleotide. From 600 primer pairs, 186 
polymorphic SSRs were developed. Of these, 158 were used to screen 29 Chinese bayberry accessions and three 
other Myrica species: 91.14%, 89.87% and 46.84% SSRs could be used in Myrica adenophora, Myrica nana and Myrica 
cerifera, respectively. The UPGMA dendrogram tree showed that cultivated Myrica rubra is closely related to Myrica 
adenophora and Myrica nana, originating in southwest China, and very distantly related to Myrica cerifera, 
originating in America. These markers can be used in the construction of a linkage map and for genetic diversity 
studies in Myrica species. 

Conclusion: Myrica rubra has a small genome of about 323 Mb with a high level of heterozygosity. A large number 
of SSRs were identified, and 158 polymorphic SSR markers developed, 91% of which can be transferred to other 
Myrica species. 



Background plants cultivated for fruit [1], growing well on poor soils 

Chinese bayberry is an important commercial horticul- due to the association of nitrogen-fixing bacteria with 

tural crop. It has been cultivated for more than the root system. It is rich in anthocyanins exhibiting a 

7,000 years in southern China, but is little known else- wide range of pharmacological properties, such as anti- 

where. The production area is currently 340,000 ha, with inflammatory, antitumor and antioxidative effects [2]. 

an annual production of 1.1 million tons. The plant is There are four species within the genus Myrica in 

diploid (2n = 16), generally dioecious, with the female China, namely Myrica rubra Sieb. & Zucc, M. esculenta 
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provinces. The best known cultivars are Biqi and 
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Dongkui, both from the Zhejiang province. Although 
there are abundant germplasm resources, studies on 
genetics and breeding of the species are still in their in- 
fancy. Molecular marker technology is a popular tool for 
breeding and genetic research, and with the construction 
of a genomic library, 13 polymorphic microsatellite loci 
have been developed in M. rubra [3] and 11 from an 
expressed sequence tag library [4]. Recently, 12 primer 
pairs have been temporarily developed by ISSR- 
suppression PCR [5] with GSG (GT) 6 as the primer for 
enriching microsatellite sequences. Reports on the gen- 
etic diversity in Chinese bayberry using SSR markers 
have also recently been published [6,7], but the number 
of markers for Chinese bayberry is limited. 

The reproducibility, multiallelicism, co-dominance, 
relative abundance and good genome coverage of SSR 
markers have made them one of the most useful tools 
for genetic diversity and linkage mapping. Genomic 
SSRs and EST-SSRs, considered complementary to plant 
genome mapping, have been reported in many fruit 
crops, such as walnut [8], cherry [9], apricot [10] and 
coconut [11]. EST-SSRs are useful for genetic analysis, 
but their relatively low polymorphism and the high pos- 
sibility of no gene-rich regions in the genome are limita- 
tions to their use. In contrast, genomic SSRs are highly 
polymorphic and tend to be widely distributed through- 
out the genome, resulting in better map coverage [12]. 

With genetic maps serving as the basis for future pos- 
itional gene cloning, making map-based cloning and 
marker-assisted selection possible, the development of 
more SSRs is essential. As sequencing technologies ad- 
vance, whole-genome shotgun (WGS) sequences are be- 
coming increasingly available. These DNA sequences are 
valuable resources for SSR development in many plant 
species, such as rice [13] and papaya [14]. In addition, 
they can be used to evaluate the frequency and distribu- 
tion of different types of SSRs in the genome, and even 
help to estimate genome size and characters such as het- 
erozygosis and repeats. 

As a way of reducing the cost of genotyping research, 
Schuelke [15] proposed a method for fluorescent dye la- 
belling of PCR fragments with a sequence-specific for- 
ward primer: the universal fluorescent-labelled M13(-21) 
primer, at the 5 end, acts as the forward primer in a 
one-tube' reaction. As this method allows for high- 
throughput genetic analyses, with a high number of 
microsatellite markers widely used, we considered the 
possibility of using this approach for multiplex PCR, to 
improve the efficiency and save costs. 

In this study, we mined and validated 158 SSR mar- 
kers and describe their application for understanding 
the genetic relationship among 29 Chinese bayberry 
accessions and other Myrica species. These markers 
are useful for genotyping and genetic diversity analysis 



and linkage mapping of Myrica rubra and other 
Myrica species. 

Results 

Genome survey using whole genome shotgun data in 
Chinese bayberry 

WGS generated 273,161 (>100 bp) high quality sequence 
reads from two DNA libraries (250 bp and 500 bp) of 
the androphyte individual 'C2010-55'. We used 9.01 G 
raw data for K-mer analysis and heterozygous simula- 
tion. For the 17-mer frequency distribution (Figure 1), 
the peak of the depth distribution was about 22. The 
estimated genome size was 323 Mb, using the formula: 
genome size = k-mer count/peak of the kmer distribu- 
tion. The minor peak at 1/2 altitude of the main peak 
indicates the high level of heterozygosity in this genome 
(Figure 1). A total of 739,969 contigs were assembled 
with a total sequence length of 255.7 Mb. The length of 
N50 was 295 bp in our assembly, and the longest contig 
and scaffold 7,593 and 127,008 bp, respectively. 

Frequency distribution of different types of SSR markers 

A total of 17,172 out of 273,161 scaffolds (6%) retrieved 
from the genome survey sequence contained 28,602 
SSRs (Table 1), of which 5,401 contained more than 
one SSR, and 1,444 SSRs were present in compound 
format. Among the derived SSR repeats, the di- 
nucleotide was the most abundant repeat, accounting 
for 84.72% of total SSRs, followed by tri- (13.78%), 
tetra- (1.34%), penta- (0.12%), and hexa- (0.04%) 
nucleotides (Table 1). There was a large proportion of 
both dinucleotides and trinucleotides while the rest 
amounted to less 2%. The average frequency of occur- 
rence was about 10.47% (Table 1). 

The SSR frequency of each motif is presented in 
Additional file 1. The SSR motif consists of 69 types. 
Among the repeat motifs of the dinucleotide, the 
AG/CT repeat was the most common, representing 
53.72%, followed by 39.20% AT repeats (Figure 2), 
and the predominant motifs of trinucleotide (AAG/ 
CTT and AAT/ATT) repeats accounted for 37.15% 
and 32.56%, respectively (Figure 3). 

Polymorphism of SSR markers 

We first designed and synthesised 600 SSR primer pairs 
from those scaffolds more than 2Kb long. The majority 
of SSR loci were dinucleotide repeats (597, 99.5%), and the 
remainder trinucleotide. Initially, the effectiveness of these 
primer pairs was detected in two cultivars (Biqi and 
Dongkui) and M. cerifera through denaturing PAGE 
(Polyacrylamide gel electrophoresis), and 581 (96.8%) of 
these were amplified successfully in Biqi and Dongkui, and 
400 (66.7%) in M. cerifera. The SSR loci (186, 31%) were 
identified as heterozygous loci either in Biqi or in Dongkui. 
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Figure 1 The distribution of 17-mer depth analysis based on whole genome shotgun data in Chinese bayberry. 



Subsequently, they were used to screen 32 accessions, and 
detected an average of 8.25 alleles and from 3 to 15 alleles 
per locus (Table 2). 

The PIC at each locus ranged from 0.256 to 0.883 with 
an average of 0.67 loci. The PCR product size ranged 
from 110 to 274 bp. All the primers produced amplicons 
in agreement with the expected sizes. Most of the SSR 
primers (139 primer pairs) showed significant deviation 
from HW equilibrium (P < 0.05). Partial correlation ana- 
lysis showed that significant positive correlations existed 
between the repeat unit length and PIC (P<0.01, 
r = 0.2747). This showed that these SSRs had high rates 
of transferability for M. adenophora (91.14%) and M. 
nana (89.87%) and low rates for ML cerifera (46.84%), 
indicating that these markers are suitable for genetic 
diversity analyses in other Myrica species. 

One of the objectives of this study was to develop 
potential suitable SSR markers for genetic mapping 
using Biqi and Dongkui as crossing parents. We 
selected 99 heterozygous loci in Biqi and 105 in 

Table 1 Occurrence of SSRs in the Genome Survey of 
Chinese bayberry 

Type Number Proportion in all SSRs (%) Frequency (%) 



Dongkui (Table 3): 135 primer pairs can be used to- 
gether in linkage mapping of the planned F 2 popula- 
tions between Biqi and Dongkui. 

Genetic relationship analysis 

The 32 accessions were divided into three groups 
(A, B and C, Figure 4), based on Nei's genetic dis- 
tance coefficient [16] using UPGMA cluster analysis. 
The similarity among all the accessions varied from 
0.54 to 0.84. At the species level, the UPGMA den- 
drogram produced clusters separating M. nana and 
M. cerifera into two distinct groups. The genetic 



Dinucleotide 24,233 84.72% 

Trinucleotide 3,941 13.78% 

Tetranucleotide 383 1.34% 

Pentanucleotide 35 0.12% 

Hexanucleotide 10 0.04% 

Total 28,602 100% 
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Figure 2 Percentage of different motifs in dinucleotide repeats 
in Chinese bayberry genome. 
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Figure 3 Percentage of different motifs in trinucleotide repeats 
in Chinese bayberry genome. 
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similarity between M. cerifera and M. rubra was 0.54, 
lower than the 0.74 previously reported by Xie [6]. 

The main cluster 'A' included the subgroups A-l and 
A-2. Subgroup A-l includes 16 accessions, mainly from 
the cities of Ningbo (12) and Hangzhou (3), where the 
popular and dominant cultivar is Biqi. This demon- 
strates that these natural elite seedling selections are 
truly distinct from the local cultivars. For their genetic 
relationships (Figure 4), the rare monoecious individual 
(C2010-4) is closely related to Biqi, while the accessions 
'Shuijing' and 'Y2010-72' (both white fruit type) are clearly 
separated in the cluster, with low genetic distance. 

Subgroup A-2 includes 14 accessions, with four from 
Wenzhou, two from Taizhou, and one each from the 
cities of Hangzhou and Ningbo, and the Hunan, 
Guangxi, Guizhou and Jiangsu provinces. This group 
includes the popular cultivar Dongkui. The four acces- 
sions from Wenzhou distributed in this cluster have 
genetic similarity. The accession 'Tongzimei' from the 
Hunan province is on an independent branch, showing 
that it is genetically distinct. 'Xiaolejiangchonghei' and 
'M. adenophord grouped together, and are possibly in 
the same population. Six androphyte accessions, distrib- 
uted in group A, are close to cultivars of the same 
geographic origin. 

The accessions l Myrica nana from Yunnan and 
'Myrica cerifera from the USA were independently 
classified as the 'B' and 'C group, indicating a distant 
relationship with cultivated Myrica rubra. 

Discussion 

Our major aims were to find a large set of SSR markers for 
Myrica rubra and understand the genetic diversity and rela- 
tionship among representative cultivars, androphyate and 
related species. This research paves the way for construct- 
ing an SSR-based linkage map in Myrica. 



The genome characteristics of genus Myrica 

Shotgun sequencing is suitable for simple genomes, with 
no or few repeat sequences, such as Chinese bayberry. 
For such genomes, the genome can largely be assembled 
simply by merging together reads containing overlapping 
sequence [17]. We report the genome survey of Chinese 
bayberry using whole genome shotgun sequencing. 
The 17-nucleotide depth distribution suggests a 
genome size of 323 Mb, larger than peach (220 Mb, 
http://www.rosaceae.org/peach/genome), but close to 
our estimate of 250 Mb from flow cytometry using 
rice as the reference (date not shown). Although the 
highly homozygous material was selected on a lim- 
ited number of SSR loci assays, the actual heterozygous 
rate, as revealed by 185 new SSR markers, was very high 
(63%). To overcome the key issue of heterozygosity and 
allow us to generate a high-quality genome sequence, we 
can use a unique homozygous form such as monoploid, 
derived using tissue culture or from nature and worth 
further study. 

Marker development for under-utilised fruit crops 

SSRs have been widely used for high-throughput 
genotyping and map construction as they have the ad- 
vantage of high abundance, random distribution within 
the genome, high polymorphism information content 
and stable co-dominance [18-20]. They can be developed 
from either genomic or expressed sequence tag (EST) li- 
braries. Although EST-SSRs are useful for genetic ana- 
lysis, their disadvantages of relatively low polymorphism 
and high concentration in gene-rich regions of the gen- 
ome may limit their usage, especially for the construc- 
tion of linkage maps [21]. In this study, a total of 600 
SSR primer pairs were designed from 28,602 SSR sites, 
and 581 (96.8%) primer pairs were effective. Due to the 
self-complementary nature to form dimers, AT/TA is 
not usually used to develop markers [12]. Our findings 
are in agreement with that published for peach, where 
the dinucleotide repeat motifs were also found to be 
the most common, and (CT) n as the most common 
repeat unit [22]. 

In the present study, the mean value of PIC was higher 
than the previously reported 0.62 [7], but the consistent 
relationship was observed between SSR polymorphism 
and repeat unit length. There are some reports of a posi- 
tive relationship between degree of polymorphism and 
repeat unit length [23,24]. However, those polymorphic 
SSRs that are homozygous in both parents cannot be 
mapped in F 2 populations, although they are useful for 
mapping in F 2 or backcross populations [25], while 
heterozygous SSRs can be used for mapping in F 2 
populations (Table 2). The estimated number of SSRs 
that can be mapped in the F 2 populations between Biqi 
and Dongkui was about 85%. 
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Table 2 Characteristics of 158 SSR markers in this study 



Locus GenBank Repeat 

~» : motif 
Accession 



Primer sequence (5-3) 



Size Na Ho He PIC P H w 

range 

(bp) a 



ZJU001 ab JQ318696 (GA)10 
ZJU002 ac JQ318697 (TQ13 
ZJU003 ab JQ318698 (AG) 11 



ZJU004 ac 



ZJU005 al 



ZJU006 at 



ZJU007 at 



ZJU008 ab 



ZJU009 at 



ZJUOKT 



ZJU01 1 a 



ZJU013 al 



ZJU014 at 



ZJU015 31 



ZJU016 at 



ZJU017 ab 



ZJU018 at 



ZJU019 at 



ZJU020 b 



JQ3 18699 
JQ3 18700 
JQ318701 
JQ3 18702 
JQ3 18703 
JQ3 18704 
JQ3 18705 
JQ3 18706 



ZJU012 ab JQ318707 



JQ3 18708 
JQ3 18709 
JQ318710 
JQ318711 
JQ318712 
JQ318713 
JQ318714 
JQ318715 



ZJU021 3 JQ318716 



(GA)10 
(AG) 14 
(GA)10 
(AG) 13 

(CD 10 
(CD 10 
(CDii 

(GA)10 
(CD 12 

(CD io 

(AG) 15 
(G A) 1 1 
(TC) 10 
(CD13 
(CD15 
(GA)12 
(AG) 10 
(TG)10 



F:<NED > <Tail-1 > CCTCTCCACCCATGAGAAAC 

RCAAATCATTCCCTGCTTTCC 

F:<NED > <Tail-1 > TCAAAGAGACGTOTGGCAG 

R:TCCGCTCACAGACAGAGAGA 

F:<NED > <Tail-1 > GTCACCITGCTCITC^GGC 

R :TCC1TGTAC1TG^CTGCTGG A 

F:<NED > <Tail-1 > AACAGAACCATCGTCAAGGC 

R:GGTACAGTCGCTCCGGmA 

F:<NED > <Tail-1 > CmGGACATGGCAACACAC 

R:TCCACmGACAGATOCCA 

F:<NED > <Tail-1 > CTCGCCCTCTCTCTCTACCA 

R:AGmATCCACCCGTGTCGT 

F:<NED > <Tail-1 > TGATCCATOGAACCATGTG 

RTCAGTOATGGTGCAGAAGC 

F:<NED > <Tail-1 > GGAGAAATGAACGGTGGAGA 

R:TCCAAAGCTAATACCCACGC 

F:<NED > <Tail-1 > AATOTCGCAAGTAGTCGCC 

R:ATATCAACCCATGGGAGCAA 

F:<NED > <Tail-1 > TGCAACATCGAAATOGAAA 

RiATGCCGGCAAGTCmGTGT 

F:<NED > <Tail-1 > GGAGGCTCGTCAGTCATCTC 

r : ™gcgtccotctctctcg 

F:<NED > <Tail-1 > OTCACTCACCGCCmCTC 

R:AATGGCCTCCACATCTCAAG 

F:<NED > <Tail-1 > ACTTGTCATTCCCACGTTCC 

RCACTCCATCTCAACCACCCT 

F:<NED > <Tail-1 > TGGAATGTCGATCTGAAACAA 

RiACCAGCTOTACGACGGTGG 

F:<NED > <Tail-1 > TOGTGTGGTGGTAATGGTG 

R:AAATAATGCAAGCAGGTGGG 

F:<NED > <Tail-1 > CCGTOACTATOCCCAG^ 

R:GGCAAmCCAAATCGCTAA 

F:<NED > <Tail-1 > ACTGAAGAACCAAACGTGGG 

R:GGTGTGmCTCTGTGTGCG 

F:<NED > <Tail-1 > ACGAAAmGACCAATCGCT 

R:AGGGmOTCTGGTOGGT 

F:<NED > <Tail-1 > mCATAACCCGTOGOTC 

R:AAGGTGGAAACGTGTCAAGG 

F:<NED > <Tail-1 > CACAGGACATGTGATGGAGG 

RCCATCCTGAGCTTTGTCGAT 

F:<NED > <Tail-1 > TCGCCAGCTOCTAATGTCT 

R:GAGCGCATGTOTOCTAAA 



160-188 7 0.1667 0.4271 0.4002 0.0000 



219-229 4 0.2083 0.5257 0.4572 0.0005 



203-217 8 0.7407 0.8344 0.7949 0.0003 



204-210 4 0.3571 0.7325 0.6704 0.0003 



200-228 11 0.3000 0.8679 0.8291 0.0000 



193-205 5 0.2593 0.3305 0.3089 0.0000 



193-209 8 0.5625 0.6617 0.6302 0.U 



191-215 10 0.7931 0.7973 0.7563 0.0002 



207-221 5 0.0741 0.3599 0.3371 0.0000 



181-205 9 0.9032 0.8012 0.7614 0.0000 



200-216 9 0.2692 0.7926 0.7554 0.0000 



184-218 13 0.5000 0.8571 0.8251 0.0000 



211-221 6 0.4444 0.5199 0.4515 0.0094 



186-212 13 0.6875 0.9033 0.8791 0.0251 



199-221 6 0.6154 0.6614 0.5902 0.0585 



196-216 11 0.6333 0.8469 0.8130 0.0179 



180-200 6 0.6250 0.7093 0.6518 0.0003 



196-216 7 0.1429 0.7189 0.6667 0.0000 



209-219 6 0.2800 0.6865 0.6317 0.0000 



201-213 7 0.5172 0.7453 0.6983 0.0000 



190-212 8 0.7778 0.7428 0.7025 0.0663 
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Table 2 Characteristics of 158 SSR markers in this study (Continued) 



ZJU022 ab JQ318717 (GA)10 



ZJU023 a< 



ZJU024 at 



ZJU025 31 



ZJU026 at 



ZJU027 



ZJU028 at 



ZJU030 at 



ZJU031 3 



ZJU032 al 



ZJU033 al 



ZJU034 at 



ZJU035 al 



JQ318718 
JQ318719 
JQ3 18720 
JQ318721 
JQ3 18722 
JQ3 18723 



ZJU029 ab JQ318724 



JQ3 18725 
JQ3 18726 
JQ318727 
JQ3 18728 
JQ3 18729 
JQ3 18730 



(AG) 15 
(TC) 10 
(TC) 10 
(TQ 10 
(TTQ8 
(AG) 10 
(AG) 10 
(CA)13 
(GA)16 
(TQ11 
(TQ10 
(CT)10 
(GA)14 



ZJU036 ab JQ318731 (GA)10 

ZJU037 ab JQ318732 (TQ10 

ZJU038 b JQ318733 (AG) 10 

ZJU039 3 JQ318734 (CT)10 

ZJU040 JQ318735 (TQ16 

ZJU041 ab JQ318736 (TQ11 

ZJU042 ab JQ318737 (TQ10 

ZJU043 b JQ318738 (CT)10 



F:<NED > <Tail-1 > AAGCTOAGCAAGCGTCGAG 

RTGCGAAGGGAAATTTCAGAC 

F:<NED > <Tail-1 > GTGmGGGCAGCACCTA^ 

R:AAAGAGTACAACAACGCGGG 

F:<NED > <Tail-1 > CCGCATGmGATOATGTC 

R:GCGTTGAGCGGAGAGATTAC 

F:<NED > <Tail-1 > mGAGCGATAGTACGGAGG 

R:ATATGCTACGTOGTOCCC 

F:<NED > <Tail-1 > CCAGACAGGTOGCCACCAT 

RGCCTCTGGATCTCGATOCG 

F:<NED > <Tail-1 > GTOCAAmGCCTCCAm 

RGGTGCCTATACTGCCAGCTC 

F:<NED > <Tail-1 > CAACCATCCAAACCAAATCC 

RTCTACCAATCGTGGCTAGGG 

F:<NED > <Tail-1 > TOTCCGGGATGTCTACAGG 

R:CAACAGCAATCGCAAAGAAA 

F:<NED > <Tail-1 > AAGTGAGCTCTCCCTCCCTC 

R:CACCGAAATAOTGCCGm 

F:<NED > <Tail-1 > GCACAGGAACACCAGGATCT 

R:CCAAGCCCTAATOCCmC 

F:<NED > <Tail-1 > ATOCCACGTOGTOAGAC 

RGATGCCTAACTCCGAATCCA 

F:<NED > <Tail-1 > GCACAAGTOCTGACATGCT 

R:AGTOCATOAACCCACACA 

F:<NED > <Tail-1 > ATGGGAATGTGGAGAACGAG 

R:GGTTGCrrCTTTGCTTTGG 

F:<NED > <Tail-1 > TOGATCCTGGTOCOTCG 

R:AAACTGCATGCATGGTTCCT 

F:<NED > <Tail-1 > CTGCCACTCTOCTGGCCTC 

R:ATGTGCCCAATOTGACTCC 

F:<NED > <Tail-1 > GTGAmCCCTCCCATOAC 

R:ACGAAGCGGGAAGTAGGA^ 

F:<NED > <Tail-1 > CTOTGGCCCGmGTAACC 

R:AACGATOCmAAGCGGAA 

F:<NED > <Tail-1 > AAACGAAAGTGGGCGTATO 

R:CACCAGTGCGTCCTATGAGA 

F:<NED > <Tail-1 > AAACTCCGTGCTGGAATCAA 

R:GCAGACAAGCOTCCTGTTC 

F:<PET ><Tail-2 > TGATCACCmCAGTOGCA 

R:CACATOGCAGAGTCCTGAA 

F:<PETxTail-2 > AGGAmCTCCAGAGGGACG 

R:GGTTCCGCATCAAACTACAAA 

F:<PETxTail-2 > AAACCGAGCTCTCCTAAGCC 

R:CTCGCAAmCTCGGGATAC 



188-208 9 0.6923 0.8575 0.8227 0.0109 



200-226 14 0.6667 0.8840 0.8544 0.0251 



180-196 6 0.6000 0.7345 0.6716 0.1624 



216-234 8 0.2667 0.7537 0.7044 0.0000 



200-220 10 0.4545 0.8573 0.8199 0.0000 



203-227 6 0.3125 0.6250 0.5321 0.0003 



164-170 4 0.1724 0.2789 0.2566 0.0000 



189-205 6 0.5312 0.6925 0.6296 0, 



193-205 7 0.4286 0.7208 0.6676 0.0000 



179-195 8 0.8387 0.7948 0.7492 0.0000 



204-226 



0.6786 0.6442 0.5852 0.0220 



195-207 6 0.0690 0.6655 0.5897 0.0000 



191-209 8 0.4138 0.7762 0.7250 0.0000 



201-217 8 0.1290 0.7425 0.6900 0.0000 



186-214 8 0.3333 0.5895 0.5516 0.0000 



208-228 9 0.8125 0.7867 0.7429 0.0135 



194-200 4 0.2273 0.5106 0.4646 0.0007 



219-229 6 0.3077 0.6161 0.5745 0.0004 



198-220 10 0.3182 0.8192 0.7798 0.0000 



226-244 5 0.2258 0.3199 0.3031 0.0000 



220-242 5 0.3571 0.5331 0, 



0.0000 



225-245 4 0.5714 0.6383 0.5667 0.2655 
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Table 2 Characteristics of 158 SSR markers in this study (Continued) 



ZJU044 ab JQ318739 (GA)12 

ZJU045 ab JQ318740 (CD 10 

ZJU046 ab JQ318741 (AG) 10 

ZJU047 ab JQ318742 (GA)13 

ZJU048 ab JQ318743 (CD 14 



ZJU049 at 



ZJU050 at 



ZJU052 al 



ZJU053 al 



ZJU054 ab 



ZJU055 31 



ZJU056 at 



ZJU057 at 



ZJU058 at 



ZJU059 b 



ZJU061 a 



ZJU062 



ZJU063 al 



ZJU064 b 



ZJU065 a< 



JQ3 18744 
JQ3 18745 



ZJU051 ab JQ318746 



JQ3 18747 
JQ3 18748 
JQ3 18749 
JQ318750 
JQ3 18751 
JQ318752 
JQ318753 
JQ3 18754 



ZJU060 ab JQ318755 



JQ3 18756 
JQ318757 
JQ318758 
JQ318759 
JQ3 18760 



(GAA)8 

(AG) 11 

(AG) 12 

(AG) 16 

(AG) 10 

(CT)13 

(CT)10 

(GA)13 

(CT)10 

(GD10 

(TC) 14 

(G^8(GA)9 

(TQ11 

(TQ10 

(TC) 12 

(GA)10 

(CA)13 



F:<PETxTail-2 > GATGGTGGOTGTOTGG^ 

R:AAGTGGGACGTCAATOCTG 

F:<PET><Tail-2 > GAGAGAGGGAGAGAGGCCAT 

RGGAAGATTCATGGGAGAGGG 

F:<PET ><Tail-2 > TOCTGTAAGCATCGCAATC 

R:AAGCTCCGGTAACACACACC 

F:<PETxTail-2 >TTCGATCATTCATGAGGCTG 

RfTTAATTGCATCCCGGATTTC 

F:<PETxTail-2 > AGCGGACCGAGTOTAGAGA 

RGCAACCCTACAAAGCGAGAG 

F:<PETxTail-2 > GTGTCTGCAGCAACTOCAC 

R:GTCGGAACCGAAGATGGTTA 

F:<PETxTail-2 > GTCACAGCCTGGATAGCTCC 

R:GTCTCTCCTGGATGAGCTGC 

F:<PETxTail-2 > AGAGAAAGACCGGGACCAAT 

R:GAGAAATAAAGCCGAGCGTG 

F:<PETxTail-2 > CCCGAGCTGAACGAAATAGA 

R:GGATCAAAGCGTOTCGm 

F:<PETxTail-2 > AAATCCGAAACACCTCTCCC 

R:ATGTGGAGACTOCACTGGG 

F:<PET > <Tail-2 > TOAmGCmGTGCAmG 

RGAAACTACCGTGCCCAACAT 

F:<PET> <Tail-2 > TTATGGGTTTCATTGGGCAG 

RTCACCAGGCTACTGCATGTC 

F:<PETxTail-2 > GACAAAGTGGGTGCCATOT 

R:TGCATGGTCCTTTCTTCCT 

F:<PET> <Tail-2 > TCATGTGGAGATTGAAGCCA 

RCGTCCCGAATGAAGATTTGT 

F:<PET ><Tail-2 > TCCGGAGCmCAATCTCAT 

R:GCCTACGAACTCAGGTTCCA 

F:<PET > <Tail-2 > TGmGmOTGCTAmCCATC 

R:GACAGTTCCCACCAGCATTT 

F:<PETxTail-2 > TGGCCAGGAACTTTGTATCC 

RGAAAGATTGGGAATGCTGGA 

F:<PETxTail-2 >TTTGGAGGAAGCAAACAAGC 

RTCCTGCGCCAACAATCTAAT 

F:<PETxTail-2 > GTCGAGAGAACAAAGCGACC 

R:GTCCAATGCCGCACTAAOT 

F:<PETxTail-2 > ACTCAGCAGGACCACCAACT 

r : ™gacggaaatoggotg 

F:<PETxTail-2 > ACCATGAAGCTGACCTGGAG 

R:mCGTGGTCCCACCTACTC 

F:<PET> <Tail-2 > TCCAGAATATCATCTCTTGCCA 

R: ATATOCTAACGTGTGCGGG 



235-255 8 0.2500 0.5091 0.4853 0.0000 



228-258 13 0.6129 0.8821 0.8544 0.0007 



226-242 7 0.3871 0.6256 0.5824 0.0000 



247-259 7 0.7097 0.7615 0.7074 0.0019 



230-254 12 0.2903 0.8493 0.8166 0.0000 



234-267 10 0.8125 0.7262 0.6797 0.0000 



233-245 7 0.3000 0.7288 0.6916 0.0000 



229-233 3 0.4333 0.4198 0.3594 0.0012 



230-248 9 0.4348 0.8628 0.8261 0.0000 



222-240 8 0.5000 0.5655 0.5211 0.0001 



232-250 9 0.3000 0.8667 0.8268 0.0003 



238-254 6 0.2500 0.7006 0.6357 0.0000 



230-246 7 0.5714 0.7643 0.7122 0.0068 



230-244 6 0.1579 0.6814 0.6283 0.0000 



252-274 11 0.7500 0.8274 0.7900 0.8036 



217-235 7 0.5200 0.7935 0.7505 0.0016 



223-243 7 0.6562 0.8110 0.7691 0.0000 



204-232 11 0.2812 0.7922 0.7506 0.0000 



240-252 7 0.2400 0.3282 0.3135 0.0004 



232-250 10 0.7000 0.8593 0.8270 0.1320 



226-244 6 0.4348 0.7256 0.6666 0.0001 



214-236 9 0.6333 0.7706 0.7219 0.0001 
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Table 2 Characteristics of 158 SSR markers in this study (Continued) 



ZJU066 ab JQ318761 (GA)10 



ZJU067 al 



ZJU068 at 



ZJU069 b ' 



ZJU070 ab 



ZJU071 a 



ZJU072 al 



ZJU074 at 



ZJU075 b 



ZJU076 ab 



ZJU077 at 



ZJU078 at 



ZJU079 ab 



ZJU080 at 



ZJU081 a 



JQ3 18762 
JQ3 18763 
JQ3 18764 
JQ3 18765 
JQ3 18766 
JQ3 18767 



ZJU073 a JQ318768 



JQ3 18769 
JQ3 18770 
JQ318771 
JQ3 18772 
JQ3 18773 
JQ3 18774 
JQ3 18775 
JQ3 18776 



ZJU082 ab JQ318777 



(CD 10 

(CT)10 

(GA)10 

(CT)11 

(GA)10 

(AG) 10 

(AG) 12 

(CT)15 

(CT)17 

(AG)9 

(AQ9 

(TC) 10 

(TC) 13 

(CT)9 

(GA)8 

(CT) 10 



ZJU083 ab JQ318778 (AG) 10 

ZJU084 ab JQ318779 (AG)9 

ZJU085 ab JQ318780 (AG)9 

ZJU086 ab JQ318781 (TQ10 

ZJU087 ab JQ318782 (GA)9 



F:<PETxTail-2 > CmCCOTGTCGCmCAG 

R:GGTCGCAGATCAGGTCAAGT 

F:<PETxTail-2 > CAGACAGCGAGGAGACAACA 

R:GGTCmCGAACmGCTCG 

F:<PETxTail-2 > GAAGCTAAACGCCAGAAACG 

R:ACTCCTCACACGAATGGGTC 

F:<PET ><Tail-2 > TGCCATAATCCTGAGAGCCT 

R:TGTTCTGTAATGGCGTGGAA 

F:<PETxTail-2 > GTGCTCGAGATGTCCTCCAT 

R:ACAATCCCATCGCATACAGG 

F:<PETxTail-2 > CTAAGGTOGTCCCTGTCCA 

R:CTTGTGTGGTGATGGTTTGG 

F:<PETxTail-2 > AGTCAGCGTGGGAATGTACC 

R:mCAGAACAAGTOGTCGC 

F:<PET ><Tail-2 > TACGCCAAGATCCAAAGACC 

RJCTCGAGTTGAGTTTGGGCT 

F:<PET ><Tail-2 > TGCAGAGGAACTGGTGACTG 

R:GAGAAGGCTCAGTGGGTGAG 

F:<PETxTail-2 > AATAAACACACAGGGCGAGG 

R:ATCGGGCAGACCAGAATATG 

F:<FAM > <Tail-3 > ATGGTOCCGACGTCCTCTG 

R:AGTGCAGAGTGCGAGATCAA 

F:<FAM > <Tail-3 >mGGAATOAACAACAmAGAC 

RTGCAGCOTGCTCCTCTTAT 

F:<FAM > <Tail-3 > ACACCACGGTTOTCGATTC 

R:GTAACGAGGCTCTOCTOC 

F:<FAM > <Tail-3 > AAGGCTAGACCGCAATCTGA 

R:GGGCAACAG^TAC1TCCCA 

F:<FAM > <Tail-3 > OTGACGAAATGCAGACGAA 

RTCCGGATCAGGGTCAAATAG 

F:<FAM > <Tail-3 > TGCTC1TGCAGAGAGTCGAG 

R:TCATAATACCCTTGGCAAACA 

F:<FAM > <Tail-3 > TATATCGAATCCCAAAGGCG 

R:AAGATATOGTCCGGCTCCT 

F:<FAM > <Tail-3 > TAGCOTGGAGAmAGGGC 

R:TOAAAmCGCAGCCTC^ 

F:<FAM > <Tail-3 > mCGATOGTGGTCTGTGA 

R^ATOACTOACmGmATOGG 

F:<FAM > <Tail-3 > GCmAACCGAGTGATGGGA 

RTAAAGGAGCGCTGGAAAGAA 

F:<FAM > <Tail-3 > TCCTCTCmCACACTOCGA 

R:GGTCGATCAmCTCTCCCA 

F:<FAM > <Tail-3 > CGAGTGTAGCTAGGAACGGC 

R:AATOGACCTGCAAATCTCG 



221-235 8 0.2593 0.6450 0.6075 0.0000 



217-263 11 0.6923 0.8273 0.7861 0.0070 



227-239 6 0.2917 0.7535 0.6913 0.0000 



224-258 8 0.2609 0.5594 0.5235 0.0004 



221-247 7 0.5200 0.7861 0.7364 0.0000 



228-234 3 0.3704 0.6157 0.5305 0.0110 



223-237 7 0.5625 0.7604 0.7117 0.0000 



222-242 7 0.2105 0.7568 0.7087 0.0000 



215-239 10 0.5517 0.8234 0.7831 0.0007 



239-255 9 0.0769 0.8650 0.8307 0.0000 



131-169 11 0.8438 0.8353 0.8034 0.0000 



137-153 8 0.2000 0.6590 0.6079 0.0000 



130-146 6 0.5500 0.7513 0.6881 0.1339 



116-134 9 0.8438 0.8596 0.8291 0.0008 



124-134 5 0.2903 0.3411 0.3172 0.0103 



137-157 6 0.5517 0.5820 0.5379 0.0003 



129-141 5 0.3438 0.4043 0.3792 0.0169 



133-157 11 0.8667 0.8960 0.8692 0.0000 



124-138 6 0.1379 0.5197 0.4766 0.0000 



150-184 8 0.6875 0.5992 0.5383 0.6352 



118-152 13 0.9062 0.8720 0.8445 0.0005 



135-149 8 0.4688 0.7748 0.7273 0.0204 
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Table 2 Characteristics of 158 SSR markers in this study (Continued) 



ZJU088 ab JQ318783 (CT)9 



ZJU089 ab 



ZJU090 at 



ZJU09f 



ZJU092 al 



ZJU093 al 



ZJU094 at 



ZJU096 at 



ZJU097 at 



ZJU098 ab 



ZJU099 at 



ZJU100 at 



ZJU101 ab 



JQ3 18784 
JQ3 18785 
JQ3 18786 
JQ3 18787 
JQ3 18788 
JQ3 18789 



ZJU095 ab JQ318790 



JQ318791 
JQ3 18792 
JQ3 18793 
JQ3 18794 
JQ3 18795 
JQ3 18796 



(GA)8 

(AG)9 

(TC) 15 

(TG)10 

(GA)10 

(CT)10 

(AG)9 

(CD 10 

(AG) 10 

(CT)9 

(GA)10 

(TC)9 

(AG)9 



ZJU102 ab JQ3 18797 (GA)10 

ZJU103 ab JQ318798 (AG)9 

ZJU104 ab JQ3 18799 (TA)9 

ZJU105 ab JQ3 18800 (G A) 1 1 

ZJU106 ab JQ318801 (GA)8 

ZJU107 ab JQ3 18802 (TQ10 

ZJU108 b JQ3 18803 (CT)9 

ZJU109 ab JQ3 18804 (TQ10 



F:<FAM > <Tail-3 > GAGCTCCGAACTTCTTCCCr 

R:CTTCTCCACAGGACTCTGCC 

F:<FAM > <Tail-3 > CGTOGGATOGGGAACAGA 

RGAGGGCTAATGTGGACCAGT 

F:<FAM > <Tail-3 > GGAAATCTCCGAATGTGATCC 

R:TGGTGGATGAACCACTCAAA 

F:<FAM > <Tail-3 > AAAGAGCACACAGCCCTAGC 

R:GGCAGTGTCGCAGTGATAGA 

F:<FAM > <Tail-3 > CTCTOCCGACCTCATO^ 

RGGGGACTCGCATAAATCACT 

F:<FAM > <Tail-3 > ATGCCATGTOCATGAGTGT 

R:TATCCCGTAAGCAATCAGGG 

F:<FAM > <Tail-3 > ATCACGGGTOTGCTGTOT 

R:CAGAAGAAGCCATTTCTGCC 

F:<FAM > <Tail-3 > TACCCACCGTACCAAAGGTC 

R:GAATGAACCCAGGCGATAGA 

F:<FAM > <Tail-3 > CATACTGCAATGCATCTCCC 

RTCAAmGTGTGCCCTOCG 

F:<FAM > <Tail-3 > AATOTOGGGAGGGCTCGT 

R:TGCGTOTGGAGACCAmA 

F:<FAM > <Tail-3 > GACGCTCCATCTCTGGTCTC 

RGCCAAACCGCACTAGAGAAA 

F:<FAM > <Tail-3 > TOTOCACTOTGGGTGAT 

R:AACTACAAACAGCCCAACCG 

F:<FAM > <Tail-3 > ACTOTCCGGATOCACAAC 

R:TCAAGGCACACAATAATGCAA 

F:<FAM > <Tail-3 > TGATOAGCTGCCAACAAAG 

R:mAACAmGGCACCGACA 

F:<FAM > <Tail-3 > GAACCACGAACTOAACCGT 

RiAACCACCAAACTOGCTCCA 

F:<FAM > <Tail-3 > TGAGGAGGGAGTOAGTOG 

R :GCGTC1TCCICCTCC1TC1T 

F:<FAM > <Tail-3 > ACGTGGCAGTOAGTOTO 

R:TCAGATCTCCGTOGAGC^ 

F:<FAM > <Tail-3 > TGAGAAACGCAGCAAGAGAA 

RGATCTCTCCCAAGCATCCTC 

F:<FAM > <Tail-3 > GCAGTCGGATAGAGAGACGG 

R:TGTOATCAAACACACCGAGA 

F:<FAM > <Tail-3 > TGGTGTCACGATOACTGGT 

R:CTGCATGTAATGGCAGTTCAA 

F:<FAM > <Tail-3 > TOGTAGTGCACTGCAGGAG 

R:CGAGGGTCGAGTTCAGAGAG 

F:<FAM > <Tail-3 > TCCGCTCTCCTCTCTGTCTC 

R:GTGAGTOTGCTGCTGCAAT 



126-150 13 0.9677 0.8773 0.8490 0.0053 



138-152 7 0.8065 0.7382 0.6778 0.0000 



118-134 8 0.2903 0.6642 0.6089 0.0000 



124-146 10 0.4615 0.8695 0.8358 0.0012 



127-151 11 0.6875 0.8264 0.7916 0.0041 



130-156 12 0.9355 0.8662 0.8367 0 



124-150 10 0.9062 0.8646 0.8332 0.0000 



114-130 7 0.4839 0.7070 0.6420 0.0004 



126-154 13 0.8000 0.8757 0.8479 0.0310 



118-134 8 0.8438 0.7778 0.7297 0.0009 



145-167 10 0.9355 0.8831 0.8549 0.0483 



130-150 9 0.7742 0.7763 0.7299 0.0000 



128-154 5 0.8333 0.6316 0.5629 0.2930 



134-154 7 0.6667 0.7062 0.6527 0.8110 



118-132 8 0.4231 0.5890 0.5441 0.0111 



121-139 10 0.7097 0.7731 0.7359 0.0003 



114-128 6 0.3704 0.6296 0.5702 0.1383 



135-157 11 0.5806 0.8165 0.7801 0.0000 



134-146 7 0.3636 0.7717 0.7203 0.0000 



114-130 



0.4375 0.5322 0.5012 0.3632 



132-160 13 0.3929 0.8253 0.7909 0.0000 



138-164 11 0.8000 0.8441 0.8082 0.0003 
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Table 2 Characteristics of 158 SSR markers in this study (Continued) 



ZJU110 ab JQ318805 (AG)9 

ZJU111 ab JQ3 18806 (TC)8 

ZJU112 ac JQ318807 (GA)8 

ZJU113 ab JQ318808 (AG)9 

ZJU114 3 JQ3 18809 (GA)10 

ZJU1 15 ab JQ318810 (AG) 14 

ZJU116 ab JQ318811 (AG) 15 

ZJU117 ab JQ318812 (A AG) 9 

ZJU1 18 ab JQ318813 (CT)9 

ZJU119 3 JQ318814 (AG) 11 

ZJU120 ab JQ318815 (GA)8 

ZJU121 3 JQ318816 (CD 11 

ZJU122 ab JQ318817 (TC)8 

ZJU123 ab JQ318818 (CT)8 

ZJU124 ab JQ318819 (CD 10 

ZJU125 ab JQ3 18820 (TC)8 

ZJU126 ab JQ3 18821 (GA)10 

ZJU127 JQ3 18822 (GQ10 

ZJU128 ab JQ3 18823 (AG) 14 

ZJU129 ab JQ3 18824 (CD 10 

ZJU130 ab JQ318825 (GA)8 

ZJU131 ab JQ318826 (CD 14 



F:<FAM > <Tail-3 > TOCACGGTGGTAGCTGTAG 

R:ACTGTGGTCCGTCGAACTCT 

F:<FAM > <Tail-3 > mCTAATGTOTOGCCCA 

RTCATCTCOTGCAGATCCC 

F:<FAM > <Tail-3 > GGAGAGTGAGAGATCGCAGC 

RGGCAACACCCTCAGTATCGT 

F:<FAM > <Tail-3 > AAACGCACCAGAGAAAGACG 

RTCCATCTCTGGTCTCCATCC 

F:<FAM > <Tail-3 > CTAGAGCGCTCCACGATACC 

R:AGAACGGTGGAGAATCGAA 

F:<FAM > <Tail-3 > GGTCTGAGGCOTCACTCTG 

RGAGACCCAATAACCCATCCA 

F:<FAM > <Tail-3 > CmCTCCGTCTGCTCCATC 

R:GTCCAAACTTGGAGCCCATA 

F:<FAM > <Tail-3 > TCTCAGATCCCTCCACGTO 

RCCACTGGATCAGGACAACCT 

F:<FAM > <Tail-3 > CAAGCCACGTGCATACCTATT 

RCAGCTGGCTTCTAACTGCAA 

F:<FAM > <Tail-3 > C^TCGACITCAGAGGCAGC 

R:TCCCTCTCAAACmGCCAC 

F:<HEX > <Tail-4 > TOGmCGmGCAAGTCA 

RGTCATCCATCCAATCCATCC 

F:<HEX > <Tail-4 > AATCACCGAAGAAATCCACG 

R : ATTGCCCTCCCTTCTGTTCr 

F:<HEX > <Tail-4 > TGACGGAAGGATACTGGCTC 

RCCATCAGACATGGCTTTCCT 

F:<HEX > <Tail-4 > TGAATOTOGGTOCCTGG 

RfTGCTTCAGTTCCAAACGAAA 

F:<HEX > <Tail-4 > GTGGCAGCCTCTCTATCGTC 

R:ATGACGTACTGCCC1TGC^ 

F:<HEX > <Tail-4 > TAAGGGCAGTCAGACCAACC 

RCTGCAGCCTACAATGATCCA 

F:<HEX > <Tail-4 > CCAATGTGGACAGGTGTGAG 

RGGCAGTAGTCGCTTCCCATA 

F:<HEX > <Tail-4 > AGGATCOTGTCACCACCAG 

R : A ATTCTTCGTCCC AGCCTC 

F:<HEX > <Tail-4 > CCCAATOACACAAATOCC 

RfTTGGCATAGCATTGTTCGTC 

F:<HEX > <Tail-4 > GAGGTGCAATOCGTGGOT 

R:TCAAGCATCAGCTGCTCAGT 

F:<HEX > <Tail-4 > GATOCATGCACCAAATCAC 

RGAATGTCCACGACGTGAATG 

F:<HEX > <Tail-4 > TOAGAATCACAAACGCCTG 

RGGTGGGTGAAATGCCTAGAA 



143-159 5 0.7667 0.6486 0.5844 0.0000 



122-136 5 0.9000 0.5701 0.4652 0.0000 



133-147 8 0.4839 0.6557 0.6212 0.0009 



138-154 6 0.6667 0.6588 0.5987 0.0130 



132-160 12 0.8214 0.8740 0.8448 0.0388 



126-156 14 0.9677 0.9022 0.8775 0.0068 



110-136 13 0.6875 0.8199 0.7846 0.0001 



118-133 6 0.4688 0.6944 0.6426 0.0000 



120-144 11 0.8750 0.8502 0.8171 0.0001 



136-152 9 0.4828 0.8348 0.7975 0.0000 



164-180 6 0.9355 0.7012 0.6354 0.0073 



164-186 11 0.8621 0.8705 0.8426 0.0000 



164-180 7 0.7742 0.7509 0.7012 0.0000 



172-176 3 0.4667 0.6367 0.5499 0.2152 



161-187 12 0.9355 0.8778 0.8498 0.0001 



164-186 4 0.2188 0.3884 0.3453 0.0000 



173-193 11 0.9677 0.8535 0.8228 0.0000 



165-189 11 0.9259 0.8288 0.7900 0.0079 



145-161 5 0.4194 0.5019 0.4496 0.1981 



161-189 10 0.7500 0.8031 0.7611 0.0234 



160-176 5 0.3478 0.4599 0.4131 0.2852 



153-187 13 0.8710 0.8990 0.8735 0.0009 



Jiao et al. BMC Genomics 2012, 13:201 
http://www.biomedcentral.eom/1 471 -21 64/1 3/201 



Page 11 of 16 



Table 2 Characteristics of 158 SSR markers in this study (Continued) 



ZJU132 ab JQ3 18827 (CD 11 

ZJU133 ab JQ3 18828 (TQ11 

ZJU134 ab JQ3 18829 (G A) 1 1 

ZJU135 ab JQ318830 (AG) 10 

ZJU136 ab JQ3 18831 (GA)10 

ZJU137 ab JQ318832 (TC)8 

ZJU138 ab JQ3 18833 (CD 10 

ZJU139 ab JQ3 18834 (GA)12 

ZJU140 b JQ3 18835 (CD 14 

ZJU141 ab JQ318836 (CD 13 

ZJU142 ab JQ318837 (TQ13 

ZJU143 ab JQ3 18838 (CD 12 

ZJU144 ab JQ3 18839 (AG) 12 

ZJU145 ab JQ3 18840 (CD 10 

ZJU146 ab JQ318841 (AG) 10 

ZJU147 ab JQ3 18842 (AG) 10 

ZJU148 ab JQ3 18843 (AG) 18 

ZJU149 ab JQ3 18844 (TC)8 

ZJU150 ab JQ318845 (AG) 10 

ZJU151 ab JQ3 18846 (CA)9 

ZJU152 ab JQ318847 (AG) 11 

ZJU153 ab JQ318848 (AG) 10 



F:<HEX > <Tail-4 > AGGCACCmCmCCTCTC 

RCAAGGAAGGAGGTGACGAAG 

F:<HEX > <Tail-4 > GCCCTGCAGTCmGTCAAT 

RCAGCTTGCAGTGTTCATTCA 

F:<HEX > <Tail-4 > AGTGCCCAAGCATGACTOT 

R:AATCAGTOTCCGAGGATGG 

F:<HEX > <Tail-4 > AAmACGGCTGTCCGTGAG 

RCCTTGGGGTCATGAACATT 

F:<HEX > <Tail-4 > TCCCACAGATCTCTAGCCGT 

RCGCTCAGTTCTTAATTTCTTACTGTC 

F:<HEX > <Tail-4 > TGGATOTGCTGCAGTOTC 

R:AGCTAGCACTGGCCTAACCA 

F:<HEX > <Tail-4 > GCACAGTOAGTOTGGGCA 

R:CTCTTTCAAATCCACGCACA 

F:<HEX > <Tail-4 > CCGAGCITCG^AGGACITG 

RCCAACAATACCCGAACCATC 

F:<HEX > <Tail-4 > TGTGCTCATOTGGATCCTG 

R:ACATCAGC1TGCATCCCTCT 

F:<HEX > <Tail-4 > CACAATCAGCTGCAGAATCAA 

R:AATGGCCGOTGCAATATAA 

F:<HEX > <Tail-4 > CATOACCTCCmCGCAAT 

R:ATCCAACGGCTCAAAGAATG 

F:<HEX > <Tail-4 > GTAGAGTAGATGCGCCTCGG 

R:ACGTACGAGCCATACACACG 

F:<HEX > <Tail-4 > GCCACTC1TCCCTCAACGTA 

RCAGGTCAGTCCTGATGGGAT 

F:<HEX > <Tail-4 > TGTGGCTGTGTOCTCCATA 

RCAATGTTGGGTGCTTTGTTG 

F:<HEX > <Tail-4 > TGGAAACmGTCGTGTGGA 

r : ™tatcgggcagccagaac 

F:<HEX > <Tail-4 >™GGAACCAAACTGGACGG 

RTCAAATGCCGTGCTATOAG 

F:<HEX > <Tail-4 > AAGAGCAGGAACCGAACOT 

R:ACCGAAAGACGAAGAAACGA 

F:<HEX > <Tail-4 > AGCCCTCCATGTGTGC™T 

R:AGGGAGAGAGTGGTOTGCC 

F:<HEX > <Tail-4 > ACTOACTGAGAGGCTGCGG 

RGTGGAAACCGAACGTCCTAA 

F:<HEX > <Tail-4 > GAATOGAAATCCCTAGCCC 

RCATTTGCGCATGTCTCCTTA 

F:<HEX > <Tail-4 > AAACGAAGTCGTOAATGCC 

R:GTGATTTGGGCCTTCGATA 

F:<HEX > <Tail-4 > CCAGCTCCGAATOGCAAAC 

R:GTGGCGGTTTATCTCATCGT 



164-178 5 0.6452 0.6568 0.5834 0.6586 



171-195 8 0.8710 0.7731 0.7267 0.0000 



172-190 8 0.9688 0.7907 0.7507 0.0004 



173-191 10 0.9688 0.7966 0.7557 0.0000 



173-201 13 0.7742 0.8953 0.8692 0.0004 



140-168 12 0.1875 0.6930 0.6612 0.0000 



152-170 8 0.3333 0.7746 0.7261 0.0001 



138-164 6 0.3667 0.4418 0.4043 0.0000 



172-198 9 0.6538 0.6139 0.5474 0.0000 



175-201 11 0.6774 0.7996 0.7600 0.0002 



166-184 9 0.6774 0.6912 0.6498 0.0231 



181-197 7 0.6923 0.7044 0.6397 0.0000 



148-164 7 0.5161 0.6864 0.6252 0.0430 



155-175 7 0.6875 0.7351 0.6912 0.0000 



154-168 6 0.2258 0.6663 0.6187 0.0000 



173-195 10 0.8333 0.7169 0.6811 0.0005 



160-190 15 0.9375 0.9067 0.8829 0.4973 



139-163 11 0.8333 0.8718 0.8417 0.0022 



163-201 10 0.9000 0.8469 0.8123 0.0053 



156-170 6 0.3750 0.5511 0.5188 0.0001 



163-181 7 0.9355 0.7578 0.7040 0.0161 



173-191 6 1.0000 0.6667 0.5927 0.0000 
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Table 2 Characteristics of 158 SSR markers in this study (Continued) 



ZJU154 ab 


JQ3 18849 


(AG) 11 


F:<HEX > <Tail-4 > TTGTCAATTGCCC1TCC1TC 
R:TOCTCCCmCCCAOTCT 


156-184 


10 


0.9333 


0.6847 


0.6184 


0.0000 


ZJU155 ab 


JQ3 18850 


(TC)9 


F:<HEX > <Tail-4 > GAGAGCAATCAGTGAAGCCC 
R:GGGAGACGGATGTCGATTTA 


160-188 


8 


0.8438 


0.6731 


0.6037 


0.0000 


ZJU156 ab 


JQ3 18851 


(TA)8 


F:<HEX > <Tail-4 > ATACGTCGAAAGATCCACCG 
R TTCTGG A ATCCTTCCC ATTG 


166-184 


7 


0.5484 


0.6626 


0.6063 


0.0000 


ZJU157 ab 


JQ3 18852 


(AG)9 


F:<HEX > <Tail-4 > CACTCACAACCAAAGCCAGA 
R:GTGCATAATCACAGGCATGA 


154-186 


13 


0.9677 


0.9064 


0.8823 


0.0171 


ZJU158 ab 


JQ3 18853 


(AT) 10 


F:<HEX > <Tail-4 > CCAGATGATCACGCAGCTTA 
RGGTCCTCCAATACGTTCCTC 


156-174 


9 


0.6452 


0.8292 


0.7917 


0.0000 


Mean 










8.25 


0.5636 


0.7178 


0.6730 





Note: a b c These SSRs are transferable for M. adenophora, M. nana and M. cerifera, respectively. SSR markers are listed according to ascending order in different 
fluorescent dyes. Shown for each primer pair are the repeat motif, primer sequences, size range (bp), number of alleles detected {Na), observed heterozygosity 
(Ho), expected heterozygosity (He), polymorphism information content (PIC) and Chi-square test for Hardy-Weinberg equilibrium (P H w)- The annealing temperature 
was 60 °C; a, including length of tail sequences (18 bp total). P H w over 0.05 are underlined. 



Recently, based on mass sequence data of Chinese 
bayberry obtained by RNA-Seq, 41,239 UniGenes were 
identified and approximately 80% of the UniGenes 
(32,805) were annotated, which provides an excellent 
platform for future EST-SSR development and functional 
genomic research [26]. 

High efficient test methods 

Normally, a universal Ml 3 primer is labelled with one of 
a number of fluorescent dyes. The tailed primer provides 
a complementary sequence to the fluorescent labelled 
Ml 3 primer, leading to the amplification of fluorescent 
PCR products, and then the PCR products of different 
sizes and/or labelled with different fluorescent dyes are 
mixed and tested [27]. In this research, a multiplex PCR 
strategy was designed using the universal M13-tailed 
primer and three additional tail primers, designed 
arbitrarily, in presumed four-plex amplifications in sin- 
gle PCR, for a major reduction in cost and time. However, 
as only a few primer combinations were successful, 

Table 3 Distribution of the segregation types expected 
for the mapping population (Biqi x Dongkui) 



Segregation type 


Alleles 


Number 


Mapping in F n 


aaxaa 


1 


12 


No 


aax bb 


2 


11 


No 


aaxab 


2 


24 


Yes 


abxaa 


2 


18 


Yes 


abxab 


2 


8 


Yes 


aax be 


3 


12 


Yes 


abxec 


3 


12 


Yes 


abxac 


3 


41 


Yes 


abxed 


4 


20 


Yes 


Total 




135 





most resulting in weak bands, we did the PCR indi- 
vidually and mixed the PCR products. Further opti- 
misation of multiplex PCR is needed to evaluate its 
general applicability. 

Evolution of Myrica species 

In this study, both cultivated species and wild species 
were analysed and their genetic diversity could easily be 
differentiated. M. nana and M. cerifera were clearly gen- 
etically distant to M. rubra. M. nana, also known as the 
dwarf or Yunnan arbutus, is indigenous to the Yunnan 
and Guizhou provinces, and has a plant height of < 2 m. 
The juvenile period of fruit tree can be shortened for 
breeding purposes. Studies on embryo culture in vitro of 
the Fi seeds of crosses between M. rubra and M. nana, 
[28], has shown good cross compatibility between M. 
rubra and M. nana, resulting in 70.5% normal seeds 
with intact embryo. M. adenophora and M. nana grow 
as wild trees, with the fruit of M. adenophora only suit- 
able for medical purposes and not edible. 

Our findings on the genetic similarity between M. 
adenophora and M. rubra, which are considered a 
progenitor-derivative species pair, are consistent with a pre- 
viously published figure of 0.897 [29]. An earlier study 
observed little change in allelic diversity along the chrono- 
sequence and no evidence for heterosis, although there was 
a moderate change in genotypic diversity [30]. The markers 
developed in this study can be very useful in future popula- 
tion structure analysis. 

Conclusions 

In summary, the genome size of Myrica genus is small, 
about 320 Mb. A large set of SSRs were developed from 
a genome survey of Myrica rubra. The results suggest 
that they have high rates of transferability, making them 
suitable for use in other Myrica species. 
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Figure 4 Dendrogram for 32 Chinese bayberry accessions derived from UPGMA cluster analysis based on 158 SSR markers. The 

symbols before the accession codes indicate the sex: o, androphyte plant, • , common cultivars, and □, monoecious plant. The numbers are 
bootstrap values based on 1000 iterations. Only bootstrap values larger than 50 are indicated. 



Materials and methods 

Plant materials and genome survey 

We selected an androphyte 'C2010-55' for the genome 
survey because it was the most homozygous (10 out of 
14 SSRs) individual among 230 accessions. Two DNA li- 
braries of 250 and 500 bp insert size were constructed 
and sequenced by Illumina Hi-Seq 2000. 

Twenty-nine accessions of the cultivated species (M 
rubra) and 3 related species (M. adenophora, M. nana, 
M. cerifera), collected from different provinces in China 
(Table 4), were used to evaluate the suitability of the 
SSRs for genetic distance analysis. Young leaves were 
collected and frozen in liquid nitrogen prior to genomic 
DNA extraction using CTAB methods [4]. DNA con- 
centrations were measured spectrophotometrically at 



260 nm, and the extracts electrophoresed on 1% 
agarose to confirm the quality. The purified DNAs 
were standardised at 40 ng/ul and stored at -40°C. 

SSR identification and primer design 

We used MISA scripting language (http://pgrc.ipk-gatersle- 
ben.de/misa/misa.html) to identify microsatellite repeats in 
our sequence database. The SSR loci containing perfect re- 
peat units of 2-6 nucleotides only were considered. The 
minimum SSR length criteria were defined as six reitera- 
tions for dinucleotide, and five reiterations for other repeat 
units. Mononucleotide repeats and complex SSR types were 
excluded from the study. 

The SSR primers were designed using BatchPrimer3 
interface modules (http://pgrc.ipk-gatersleben.de/misa/ 
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primer3.html). We selected 600 primers that met the fol- 
lowing parameters: 110-230 final product length, primer 
size from 18 to 22 bp with an optimum size of 20 bp, 
and the annealing temperature was set at 60°C The 
repeat units over eight were used. 

Tail-1(M13 universal sequence-TGTAAAACGACGG 
CCAGT), Tail- 2 ( CG AG TC AG TATAG G G C AC) , Tail-3 
( ATC ACG A AG C AG ATG C A A) and Tail-4(GAGCAT 
CTCGTACCAGTC) were added to the 5' end of each 
150 forward primers of pairs respectively. Tail-2, Tail-3 
and Tail-4 were designed so that the primer size was 
18 bp and the annealing temperature was 53°C. Primers 



were synthesised by Invitrogen Trading (Shanghai) Co., 
Ltd. We primarily tested two cultivars (Biqi and Dongkui) 
and M. cerifera for 600 SSR loci by PAGE (polyacrylamide 
denaturing gel) to confirm their suitability. Tail-1, Tail-2, 
Tail-3 and Tail-4 labelled with one of the following dyes: 
NED, PET, FAM, and HEX, respectively. 

Polymerase chain reaction and gel electrophoresis 

Each 20 \i\ reaction mixture contained 10 x PCR buffer 
(plus Mg2+), 0.2 mM of each dNTP, 5 pmol of each 
reverse, 4 pmol of the tail primer, 1 pmol of the 
forward primer, 0.5 units of rTaq polymerase (TaKaRa, 



Table 4 The 32 bayberry accessions included in this study 



No. 


Accession 


Fruit/Flower colour 3 


Fruit maturity date 


Region 


1 


Biqi 


black 


Late June 


Cixi, Ningbo, Zhejiang 


2 


Dongkui 


red 


Early July 


Taizhou, Zhejiang 


3 


Dayehuang 


red 


Mid-June 


Hangzhou, Zhejiang 


4 


Dingaomei 


black 


Mid to late June 


Wenzhou, Zhejiang 


5 


Huangshanbai 


white 


Early July 


Hangzhou, Zhejiang 


6 


Jiazhaizao 


black 


Mid-June 


Wenzhou, Zhejiang 


7 


Jianmei 


red 


Late June 


Cixi, Ningbo, Zhejiang 


8 


Muyemei 


black 


Late June 


Jinhua, Zhejiang 


9 


Putaoli 


black 


Mid June 


Hangzhou, Zhejiang 


10 


Shuijing 


white 


Late June/Early July 


Yuyao, Ningbo, Zhejiang 


11 


Tongzimei 


black 


Mid-June 


Hunan 


12 


Wandao 


black 


Early July 


Zhoushan, Zhejiang 


13 


Xiaolejiangchonghei 


black 


May 


Guizhou 


14 


Biqi12 


black 


Late June 


Yuyao, Ningbo, Zhejiang 


15 


Y20 10-70 


red 


Late June/Early July 


Yuyao, Ningbo, Zhejiang 


16 


Y20 10-71 


black 


Mid to late June 


Yuyao, Ningbo, Zhejiang 


17 


Y20 10-72 


white 


Late June/Early July 


Yuyao, Ningbo, Zhejiang 


18 


Y20 10-73 


red 


Late June 


Yuyao, Ningbo, Zhejiang 


19 


Y20 10-74 


red 


Late June/Early July 


Yuyao, Ningbo, Zhejiang 


20 


Y20 10-75 


black 


Late June 


Yuyao, Ningbo, Zhejiang 


21 


Y20 10-76 


white 


Late June/Early July 


Yuyao, Ningbo, Zhejiang 


22 


Y2010-77 


red 


Late June/Early July 


Yuyao, Ningbo, Zhejiang 


23 


C2010-4 


red 


Late June 


Cixi, Ningbo, Zhejiang 


24 


*C2010-55 


red 




Cixi, Ningbo, Zhejiang 


25 


*W2011-1 


yellow- red 




Wenzhou, Zhejiang 


26 


*W2011-5 


red 




Wenzhou, Zhejiang 


27 


*H2011-12 


yellow-green 




Hangzhou, Zhejiang 


28 


*JS2011-16 


red 




Suzhou, Jiangsu 


29 


*T20 11-30 


red 




Taizhou, Zhejiang 


30 


Myrico odenophoro 


red 


February to May 


Guilin, Guangxi 


31 


Myrico nana 


red 


June to July 


Yunnan 


32 


*Myrica cerifera 


yellow-green 




Cixi, Ningbo, Zhejiang 



Note: fruit colour for cultivar and flower colour for androphyte. * selected androphytes. 
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China) and 40 ng genomic DNA template. Each 
primer pair had an interval of 20 bp according to the 
expected size of amplicons. 

DNA amplification was in an Eppendorf Mastercycler 
(Germany) programmed at 94°C for 5 min for initial de- 
naturation, then 32 cycles at 94°C (30 s)/58°C (30 s)/72°C 
(30 s), followed by 8 cycles of 94°C (30 s)/53°C (30 s)/72°C 
(30 s). The final extension step was 10 min at 72°C. Each 
PCR product was run on 1% agarose gel at 110 V for a 
quality check. 

Subsequently, PCR products were electrophoresed on 
8% denaturing PAGE, according to Myers et al. [31], at 
60 W in a Sequi-Gen GT Nucleic Acid electrophoresis 
cell (BioRad) for 4 h, depending on the fragment sizes to 
be separated, and visualised by silver staining [32]. 
Genotypes showing one and two bands were scored as 
homozygous and heterozygous, respectively, and the 
results recorded and photographed. 

Multiplex PCR was designed and tested with products 
of different sizes and labelled with different fluorescent 
dyes. Each 20 \A reaction mixture contained 10 x PCR 
buffer (plus Mg 2+ ), 0.8 mM of each dNTP, 1 unit of rTaq 
polymerase, 40 ng genomic DNA template and a total of 
four primer pairs with 5 pmol of each reverse primer, 4 
pmol of each tail primer, and 1 pmol of each forward 
primer. The PCR products were diluted, mixed with the 
internal size standard LIZ500 (Applied Biosystems) and 
loaded on an ABI 3130 Genetic Analyzer. Alleles were 
scored using GeneMapper version 4.0 software (Applied 
Biosystems, Foster City, CA, USA). 
Data analysis 

The raw genome sequence data was first filtered to ob- 
tain high quality reads, then assembled using SOAP 
(http://soap.genomics.org.cn/soapdenovo.html) denovo 
software to contig, scaffold and fill in gaps. In addition, 
we used SSPACE software to build the scaffold. K-mer 
analysis was to help estimate the genome size and char- 
acters, such as heterozygosis and repeats. 

The number of alleles (A), observed heterozygosity (Ho) 
and expected heterozygosity (He) were calculated using 
POPGENE version 1.32 (http://www.ualberta.ca/~fyeh/pop- 
gene_download.html). Chi-square test for Hardy- Weinberg 
equilibrium allele frequencies and polymorphism informa- 
tion content (PIC) was calculated using PowerMarker 
version 3.25 [33] (http://statgen.ncsu.edu/powermarker/ 
downloads.htm). Microsoft office Excel 2007 was used to 
analyse the correlation. Genetic similarity among all the 
accessions was calculated according to Dices coeffi- 
cients using Nei's coefficient index [16] with the Free 
Tree 0.9.1.50[34] (http://www.natur.cuni.cz/~flegr/ 
programs/freetree.htm) software, and the dendrogram 
constructed using the unweighted pair-group method with 
arithmetic averaging (UPGMA) option. The confidence of 
branch support was then evaluated by bootstrap analysis 



with 1,000 replicates. The dendrogram was printed using 
MEGA version 5.05 software [35] (http://www.megasoft- 
ware.net/mega.php). 

Additional material 



Additional file 1: Occurrence of different SSRs in the genome 
survey of Chinese bayberry. 
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